Overview
Brought to you by YData
Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 8839 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1018.6 KiB |
| Average record size in memory | 118.0 B |
Variable types
| Categorical | 1 |
|---|---|
| Boolean | 6 |
| Text | 1 |
| Numeric | 9 |
| DateTime | 2 |
type has constant value "True" | Constant |
label is highly imbalanced (73.4%) | Imbalance |
site_admin is highly imbalanced (94.8%) | Imbalance |
public_repos is highly skewed (γ1 = 30.97124933) | Skewed |
public_gists is highly skewed (γ1 = 57.42774864) | Skewed |
following is highly skewed (γ1 = 31.14344449) | Skewed |
created_at has unique values | Unique |
public_repos has 203 (2.3%) zeros | Zeros |
public_gists has 2527 (28.6%) zeros | Zeros |
followers has 233 (2.6%) zeros | Zeros |
following has 1516 (17.2%) zeros | Zeros |
text_bot_count has 8513 (96.3%) zeros | Zeros |
log_public_repos has 203 (2.3%) zeros | Zeros |
log_public_gists has 2527 (28.6%) zeros | Zeros |
log_followers has 233 (2.6%) zeros | Zeros |
log_following has 1516 (17.2%) zeros | Zeros |
Reproduction
| Analysis started | 2024-11-26 05:04:15.930361 |
|---|---|
| Analysis finished | 2024-11-26 05:04:25.547959 |
| Duration | 9.62 seconds |
| Software version | ydata-profiling vv4.12.0 |
| Download configuration | config.json |
Variables
label
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 138.1 KiB |
| Human | |
|---|---|
| Bot | 401 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.9092658 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Human |
|---|---|
| 2nd row | Human |
| 3rd row | Human |
| 4th row | Human |
| 5th row | Human |
Common Values
| Value | Count | Frequency (%) |
| Human | 8438 | |
| Bot | 401 | 4.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| human | 8438 | |
| bot | 401 | 4.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| H | 8438 | |
| u | 8438 | |
| m | 8438 | |
| a | 8438 | |
| n | 8438 | |
| B | 401 | 0.9% |
| o | 401 | 0.9% |
| t | 401 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 43393 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| H | 8438 | |
| u | 8438 | |
| m | 8438 | |
| a | 8438 | |
| n | 8438 | |
| B | 401 | 0.9% |
| o | 401 | 0.9% |
| t | 401 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 43393 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| H | 8438 | |
| u | 8438 | |
| m | 8438 | |
| a | 8438 | |
| n | 8438 | |
| B | 401 | 0.9% |
| o | 401 | 0.9% |
| t | 401 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 43393 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| H | 8438 | |
| u | 8438 | |
| m | 8438 | |
| a | 8438 | |
| n | 8438 | |
| B | 401 | 0.9% |
| o | 401 | 0.9% |
| t | 401 | 0.9% |
| Value | Count | Frequency (%) |
| True | 8839 |
site_admin
Boolean
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 77.7 KiB |
| False | |
|---|---|
| True | 52 |
| Value | Count | Frequency (%) |
| False | 8787 | |
| True | 52 | 0.6% |
| Value | Count | Frequency (%) |
| True | 6168 | |
| False | 2671 |
| Value | Count | Frequency (%) |
| True | 5553 | |
| False | 3286 |
| Value | Count | Frequency (%) |
| True | 7312 | |
| False | 1527 | 17.3% |
| Value | Count | Frequency (%) |
| False | 6632 | |
| True | 2207 | 25.0% |
bio
Text
| Distinct | 8641 |
|---|---|
| Distinct (%) | 97.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 138.1 KiB |
Length
| Max length | 160 |
|---|---|
| Median length | 116 |
| Mean length | 61.460459 |
| Min length | 1 |
Unique
| Unique | 8574 ? |
|---|---|
| Unique (%) | 97.0% |
Sample
| 1st row | I just press the buttons randomly, and the program evolves... |
|---|---|
| 2nd row | Time is unimportant, only life important. |
| 3rd row | Done studying. Need challenges. |
| 4th row | Administrator of MOONGIFT that is introducing open source software everyday to Japanese engineers since 2004. |
| 5th row | Senior Software Engineer at Google, working on Certificate Transparency and generalized transparency. |
| Value | Count | Frequency (%) |
| 3069 | 3.9% | |
| and | 2526 | 3.2% |
| engineer | 1583 | 2.0% |
| software | 1521 | 1.9% |
| of | 1488 | 1.9% |
| at | 1380 | 1.8% |
| developer | 1236 | 1.6% |
| the | 1086 | 1.4% |
| a | 1038 | 1.3% |
| i | 1033 | 1.3% |
| Other values (14754) | 62407 |
Most occurring characters
| Value | Count | Frequency (%) |
| 70014 | 12.9% | |
| e | 49589 | 9.1% |
| o | 32360 | 6.0% |
| n | 31402 | 5.8% |
| a | 31366 | 5.8% |
| t | 31195 | 5.7% |
| r | 31181 | 5.7% |
| i | 28526 | 5.3% |
| s | 19655 | 3.6% |
| l | 14767 | 2.7% |
| Other values (1736) | 203194 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 543249 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 70014 | 12.9% | |
| e | 49589 | 9.1% |
| o | 32360 | 6.0% |
| n | 31402 | 5.8% |
| a | 31366 | 5.8% |
| t | 31195 | 5.7% |
| r | 31181 | 5.7% |
| i | 28526 | 5.3% |
| s | 19655 | 3.6% |
| l | 14767 | 2.7% |
| Other values (1736) | 203194 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 543249 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 70014 | 12.9% | |
| e | 49589 | 9.1% |
| o | 32360 | 6.0% |
| n | 31402 | 5.8% |
| a | 31366 | 5.8% |
| t | 31195 | 5.7% |
| r | 31181 | 5.7% |
| i | 28526 | 5.3% |
| s | 19655 | 3.6% |
| l | 14767 | 2.7% |
| Other values (1736) | 203194 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 543249 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 70014 | 12.9% | |
| e | 49589 | 9.1% |
| o | 32360 | 6.0% |
| n | 31402 | 5.8% |
| a | 31366 | 5.8% |
| t | 31195 | 5.7% |
| r | 31181 | 5.7% |
| i | 28526 | 5.3% |
| s | 19655 | 3.6% |
| l | 14767 | 2.7% |
| Other values (1736) | 203194 |
public_repos
Real number (ℝ)
Skewed  Zeros 
| Distinct | 594 |
|---|---|
| Distinct (%) | 6.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 115.67112 |
| Minimum | 0 |
|---|---|
| Maximum | 26360 |
| Zeros | 203 |
| Zeros (%) | 2.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 22 |
| median | 55 |
| Q3 | 114 |
| 95-th percentile | 319 |
| Maximum | 26360 |
| Range | 26360 |
| Interquartile range (IQR) | 92 |
Descriptive statistics
| Standard deviation | 588.34113 |
|---|---|
| Coefficient of variation (CV) | 5.086327 |
| Kurtosis | 1112.9756 |
| Mean | 115.67112 |
| Median Absolute Deviation (MAD) | 39 |
| Skewness | 30.971249 |
| Sum | 1022417 |
| Variance | 346145.28 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 203 | 2.3% |
| 1 | 123 | 1.4% |
| 15 | 109 | 1.2% |
| 14 | 106 | 1.2% |
| 11 | 99 | 1.1% |
| 19 | 98 | 1.1% |
| 7 | 98 | 1.1% |
| 24 | 96 | 1.1% |
| 6 | 96 | 1.1% |
| 8 | 95 | 1.1% |
| Other values (584) | 7716 |
| Value | Count | Frequency (%) |
| 0 | 203 | |
| 1 | 123 | |
| 2 | 89 | |
| 3 | 80 | 0.9% |
| 4 | 92 | |
| 5 | 81 | 0.9% |
| 6 | 96 | |
| 7 | 98 | |
| 8 | 95 | |
| 9 | 91 |
| Value | Count | Frequency (%) |
| 26360 | 1 | |
| 22618 | 1 | |
| 20693 | 1 | |
| 17425 | 1 | |
| 16985 | 1 | |
| 16839 | 1 | |
| 9666 | 1 | |
| 9554 | 1 | |
| 6344 | 1 | |
| 6079 | 1 |
public_gists
Real number (ℝ)
Skewed  Zeros 
| Distinct | 305 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.434551 |
| Minimum | 0 |
|---|---|
| Maximum | 55781 |
| Zeros | 2527 |
| Zeros (%) | 28.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 4 |
| Q3 | 17 |
| 95-th percentile | 90 |
| Maximum | 55781 |
| Range | 55781 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 881.83821 |
|---|---|
| Coefficient of variation (CV) | 22.943892 |
| Kurtosis | 3452.1685 |
| Mean | 38.434551 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 57.427749 |
| Sum | 339723 |
| Variance | 777638.63 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2527 | |
| 1 | 801 | 9.1% |
| 2 | 545 | 6.2% |
| 3 | 390 | 4.4% |
| 4 | 324 | 3.7% |
| 5 | 318 | 3.6% |
| 6 | 277 | 3.1% |
| 7 | 202 | 2.3% |
| 9 | 194 | 2.2% |
| 8 | 169 | 1.9% |
| Other values (295) | 3092 |
| Value | Count | Frequency (%) |
| 0 | 2527 | |
| 1 | 801 | 9.1% |
| 2 | 545 | 6.2% |
| 3 | 390 | 4.4% |
| 4 | 324 | 3.7% |
| 5 | 318 | 3.6% |
| 6 | 277 | 3.1% |
| 7 | 202 | 2.3% |
| 8 | 169 | 1.9% |
| 9 | 194 | 2.2% |
| Value | Count | Frequency (%) |
| 55781 | 1 | |
| 53660 | 1 | |
| 26879 | 1 | |
| 10604 | 1 | |
| 3450 | 1 | |
| 1750 | 1 | |
| 1679 | 1 | |
| 1611 | 1 | |
| 1513 | 1 | |
| 1337 | 1 |
followers
Real number (ℝ)
Zeros 
| Distinct | 1405 |
|---|---|
| Distinct (%) | 15.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 380.20251 |
| Minimum | 0 |
|---|---|
| Maximum | 58452 |
| Zeros | 233 |
| Zeros (%) | 2.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 22 |
| median | 73 |
| Q3 | 235 |
| 95-th percentile | 1470.9 |
| Maximum | 58452 |
| Range | 58452 |
| Interquartile range (IQR) | 213 |
Descriptive statistics
| Standard deviation | 1522.4168 |
|---|---|
| Coefficient of variation (CV) | 4.004226 |
| Kurtosis | 362.87009 |
| Mean | 380.20251 |
| Median Absolute Deviation (MAD) | 64 |
| Skewness | 15.059094 |
| Sum | 3360610 |
| Variance | 2317752.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 233 | 2.6% |
| 1 | 146 | 1.7% |
| 2 | 128 | 1.4% |
| 4 | 118 | 1.3% |
| 3 | 116 | 1.3% |
| 9 | 111 | 1.3% |
| 6 | 110 | 1.2% |
| 7 | 104 | 1.2% |
| 5 | 98 | 1.1% |
| 16 | 97 | 1.1% |
| Other values (1395) | 7578 |
| Value | Count | Frequency (%) |
| 0 | 233 | |
| 1 | 146 | |
| 2 | 128 | |
| 3 | 116 | |
| 4 | 118 | |
| 5 | 98 | |
| 6 | 110 | |
| 7 | 104 | |
| 8 | 92 | 1.0% |
| 9 | 111 |
| Value | Count | Frequency (%) |
| 58452 | 1 | |
| 31120 | 1 | |
| 29719 | 1 | |
| 29414 | 1 | |
| 28411 | 1 | |
| 25815 | 1 | |
| 24893 | 1 | |
| 22707 | 1 | |
| 22520 | 1 | |
| 21057 | 1 |
following
Real number (ℝ)
Skewed  Zeros 
| Distinct | 555 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.67847 |
| Minimum | 0 |
|---|---|
| Maximum | 27775 |
| Zeros | 1516 |
| Zeros (%) | 17.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 12 |
| Q3 | 44 |
| 95-th percentile | 242.1 |
| Maximum | 27775 |
| Range | 27775 |
| Interquartile range (IQR) | 42 |
Descriptive statistics
| Standard deviation | 508.36665 |
|---|---|
| Coefficient of variation (CV) | 6.8074057 |
| Kurtosis | 1321.4193 |
| Mean | 74.67847 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 31.143444 |
| Sum | 660083 |
| Variance | 258436.65 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1516 | 17.2% |
| 1 | 581 | 6.6% |
| 2 | 398 | 4.5% |
| 3 | 299 | 3.4% |
| 4 | 272 | 3.1% |
| 5 | 238 | 2.7% |
| 6 | 229 | 2.6% |
| 8 | 191 | 2.2% |
| 7 | 179 | 2.0% |
| 9 | 168 | 1.9% |
| Other values (545) | 4768 |
| Value | Count | Frequency (%) |
| 0 | 1516 | |
| 1 | 581 | 6.6% |
| 2 | 398 | 4.5% |
| 3 | 299 | 3.4% |
| 4 | 272 | 3.1% |
| 5 | 238 | 2.7% |
| 6 | 229 | 2.6% |
| 7 | 179 | 2.0% |
| 8 | 191 | 2.2% |
| 9 | 168 | 1.9% |
| Value | Count | Frequency (%) |
| 27775 | 1 | |
| 16741 | 1 | |
| 15931 | 1 | |
| 10268 | 1 | |
| 9720 | 1 | |
| 9686 | 1 | |
| 9532 | 1 | |
| 9367 | 1 | |
| 7374 | 1 | |
| 5879 | 1 |
created_at
Date
Unique 
| Distinct | 8839 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 138.1 KiB |
| Minimum | 2008-01-27 07:09:47+00:00 |
|---|---|
| Maximum | 2021-12-05 22:58:37+00:00 |
updated_at
Date
| Distinct | 8805 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 138.1 KiB |
| Minimum | 2018-08-06 22:55:54+00:00 |
|---|---|
| Maximum | 2023-10-14 14:33:48+00:00 |
text_bot_count
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.068786062 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 8513 |
| Zeros (%) | 96.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.39259064 |
|---|---|
| Coefficient of variation (CV) | 5.7074156 |
| Kurtosis | 50.063152 |
| Mean | 0.068786062 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.6884455 |
| Sum | 608 |
| Variance | 0.15412741 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 8513 | |
| 1 | 137 | 1.5% |
| 2 | 114 | 1.3% |
| 3 | 62 | 0.7% |
| 4 | 8 | 0.1% |
| 5 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 8513 | |
| 1 | 137 | 1.5% |
| 2 | 114 | 1.3% |
| 3 | 62 | 0.7% |
| 4 | 8 | 0.1% |
| 5 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 5 | 5 | 0.1% |
| 4 | 8 | 0.1% |
| 3 | 62 | 0.7% |
| 2 | 114 | 1.3% |
| 1 | 137 | 1.5% |
| 0 | 8513 |
log_public_repos
Real number (ℝ)
Zeros 
| Distinct | 594 |
|---|---|
| Distinct (%) | 6.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.8694923 |
| Minimum | 0 |
|---|---|
| Maximum | 10.179641 |
| Zeros | 203 |
| Zeros (%) | 2.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.3862944 |
| Q1 | 3.1354942 |
| median | 4.0253517 |
| Q3 | 4.7449321 |
| 95-th percentile | 5.768321 |
| Maximum | 10.179641 |
| Range | 10.179641 |
| Interquartile range (IQR) | 1.6094379 |
Descriptive statistics
| Standard deviation | 1.3322031 |
|---|---|
| Coefficient of variation (CV) | 0.34428369 |
| Kurtosis | 1.0391104 |
| Mean | 3.8694923 |
| Median Absolute Deviation (MAD) | 0.78683266 |
| Skewness | -0.52674041 |
| Sum | 34202.442 |
| Variance | 1.774765 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 203 | 2.3% |
| 0.6931471806 | 123 | 1.4% |
| 2.772588722 | 109 | 1.2% |
| 2.708050201 | 106 | 1.2% |
| 2.48490665 | 99 | 1.1% |
| 2.995732274 | 98 | 1.1% |
| 2.079441542 | 98 | 1.1% |
| 3.218875825 | 96 | 1.1% |
| 1.945910149 | 96 | 1.1% |
| 2.197224577 | 95 | 1.1% |
| Other values (584) | 7716 |
| Value | Count | Frequency (%) |
| 0 | 203 | |
| 0.6931471806 | 123 | |
| 1.098612289 | 89 | |
| 1.386294361 | 80 | 0.9% |
| 1.609437912 | 92 | |
| 1.791759469 | 81 | 0.9% |
| 1.945910149 | 96 | |
| 2.079441542 | 98 | |
| 2.197224577 | 95 | |
| 2.302585093 | 91 |
| Value | Count | Frequency (%) |
| 10.17964092 | 1 | |
| 10.02654554 | 1 | |
| 9.937599082 | 1 | |
| 9.765718623 | 1 | |
| 9.740144754 | 1 | |
| 9.731512288 | 1 | |
| 9.176473302 | 1 | |
| 9.164819857 | 1 | |
| 8.75542238 | 1 | |
| 8.712759975 | 1 |
log_public_gists
Real number (ℝ)
Zeros 
| Distinct | 305 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7520114 |
| Minimum | 0 |
|---|---|
| Maximum | 10.929207 |
| Zeros | 2527 |
| Zeros (%) | 28.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1.6094379 |
| Q3 | 2.8903718 |
| 95-th percentile | 4.5108595 |
| Maximum | 10.929207 |
| Range | 10.929207 |
| Interquartile range (IQR) | 2.8903718 |
Descriptive statistics
| Standard deviation | 1.5616003 |
|---|---|
| Coefficient of variation (CV) | 0.89131858 |
| Kurtosis | -0.14700818 |
| Mean | 1.7520114 |
| Median Absolute Deviation (MAD) | 1.4816045 |
| Skewness | 0.61543332 |
| Sum | 15486.029 |
| Variance | 2.4385955 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2527 | |
| 0.6931471806 | 801 | 9.1% |
| 1.098612289 | 545 | 6.2% |
| 1.386294361 | 390 | 4.4% |
| 1.609437912 | 324 | 3.7% |
| 1.791759469 | 318 | 3.6% |
| 1.945910149 | 277 | 3.1% |
| 2.079441542 | 202 | 2.3% |
| 2.302585093 | 194 | 2.2% |
| 2.197224577 | 169 | 1.9% |
| Other values (295) | 3092 |
| Value | Count | Frequency (%) |
| 0 | 2527 | |
| 0.6931471806 | 801 | 9.1% |
| 1.098612289 | 545 | 6.2% |
| 1.386294361 | 390 | 4.4% |
| 1.609437912 | 324 | 3.7% |
| 1.791759469 | 318 | 3.6% |
| 1.945910149 | 277 | 3.1% |
| 2.079441542 | 202 | 2.3% |
| 2.197224577 | 169 | 1.9% |
| 2.302585093 | 194 | 2.2% |
| Value | Count | Frequency (%) |
| 10.92920652 | 1 | |
| 10.89044176 | 1 | |
| 10.19913779 | 1 | |
| 9.269080867 | 1 | |
| 8.146419323 | 1 | |
| 7.467942332 | 1 | |
| 7.426549072 | 1 | |
| 7.385230923 | 1 | |
| 7.322510434 | 1 | |
| 7.198931241 | 1 |
log_followers
Real number (ℝ)
Zeros 
| Distinct | 1405 |
|---|---|
| Distinct (%) | 15.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.2764705 |
| Minimum | 0 |
|---|---|
| Maximum | 10.975978 |
| Zeros | 233 |
| Zeros (%) | 2.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.0986123 |
| Q1 | 3.1354942 |
| median | 4.3040651 |
| Q3 | 5.4638318 |
| 95-th percentile | 7.2943019 |
| Maximum | 10.975978 |
| Range | 10.975978 |
| Interquartile range (IQR) | 2.3283376 |
Descriptive statistics
| Standard deviation | 1.824143 |
|---|---|
| Coefficient of variation (CV) | 0.42655339 |
| Kurtosis | 0.043933647 |
| Mean | 4.2764705 |
| Median Absolute Deviation (MAD) | 1.1685709 |
| Skewness | -0.023421046 |
| Sum | 37799.723 |
| Variance | 3.3274977 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 233 | 2.6% |
| 0.6931471806 | 146 | 1.7% |
| 1.098612289 | 128 | 1.4% |
| 1.609437912 | 118 | 1.3% |
| 1.386294361 | 116 | 1.3% |
| 2.302585093 | 111 | 1.3% |
| 1.945910149 | 110 | 1.2% |
| 2.079441542 | 104 | 1.2% |
| 1.791759469 | 98 | 1.1% |
| 2.833213344 | 97 | 1.1% |
| Other values (1395) | 7578 |
| Value | Count | Frequency (%) |
| 0 | 233 | |
| 0.6931471806 | 146 | |
| 1.098612289 | 128 | |
| 1.386294361 | 116 | |
| 1.609437912 | 118 | |
| 1.791759469 | 98 | |
| 1.945910149 | 110 | |
| 2.079441542 | 104 | |
| 2.197224577 | 92 | 1.0% |
| 2.302585093 | 111 |
| Value | Count | Frequency (%) |
| 10.97597829 | 1 | |
| 10.34563811 | 1 | |
| 10.2995755 | 1 | |
| 10.28926003 | 1 | |
| 10.25456687 | 1 | |
| 10.15874973 | 1 | |
| 10.12238209 | 1 | |
| 10.03047256 | 1 | |
| 10.02220349 | 1 | |
| 9.955035814 | 1 |
log_following
Real number (ℝ)
Zeros 
| Distinct | 555 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5342293 |
| Minimum | 0 |
|---|---|
| Maximum | 10.231928 |
| Zeros | 1516 |
| Zeros (%) | 17.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1.0986123 |
| median | 2.5649494 |
| Q3 | 3.8066625 |
| 95-th percentile | 5.4934721 |
| Maximum | 10.231928 |
| Range | 10.231928 |
| Interquartile range (IQR) | 2.7080502 |
Descriptive statistics
| Standard deviation | 1.7952236 |
|---|---|
| Coefficient of variation (CV) | 0.70839036 |
| Kurtosis | -0.50414565 |
| Mean | 2.5342293 |
| Median Absolute Deviation (MAD) | 1.4423838 |
| Skewness | 0.27163164 |
| Sum | 22400.053 |
| Variance | 3.2228277 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1516 | 17.2% |
| 0.6931471806 | 581 | 6.6% |
| 1.098612289 | 398 | 4.5% |
| 1.386294361 | 299 | 3.4% |
| 1.609437912 | 272 | 3.1% |
| 1.791759469 | 238 | 2.7% |
| 1.945910149 | 229 | 2.6% |
| 2.197224577 | 191 | 2.2% |
| 2.079441542 | 179 | 2.0% |
| 2.302585093 | 168 | 1.9% |
| Other values (545) | 4768 |
| Value | Count | Frequency (%) |
| 0 | 1516 | |
| 0.6931471806 | 581 | 6.6% |
| 1.098612289 | 398 | 4.5% |
| 1.386294361 | 299 | 3.4% |
| 1.609437912 | 272 | 3.1% |
| 1.791759469 | 238 | 2.7% |
| 1.945910149 | 229 | 2.6% |
| 2.079441542 | 179 | 2.0% |
| 2.197224577 | 191 | 2.2% |
| 2.302585093 | 168 | 1.9% |
| Value | Count | Frequency (%) |
| 10.23192762 | 1 | |
| 9.725675811 | 1 | |
| 9.676084944 | 1 | |
| 9.236884927 | 1 | |
| 9.182043773 | 1 | |
| 9.178540059 | 1 | |
| 9.162514742 | 1 | |
| 9.145054905 | 1 | |
| 8.905851181 | 1 | |
| 8.679312041 | 1 |
Interactions
Missing values
Sample
| label | type | site_admin | company | blog | location | hireable | bio | public_repos | public_gists | followers | following | created_at | updated_at | text_bot_count | log_public_repos | log_public_gists | log_followers | log_following | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Human | True | False | False | True | False | True | I just press the buttons randomly, and the program evolves... | 30 | 3 | 9 | 6 | 2015-06-29 10:12:46+00:00 | 2023-10-07 06:26:14+00:00 | 0 | 3.433987 | 1.386294 | 2.302585 | 1.945910 |
| 2 | Human | True | False | True | True | True | True | Time is unimportant,\nonly life important. | 103 | 49 | 1212 | 221 | 2008-08-29 16:20:03+00:00 | 2023-10-02 02:11:21+00:00 | 0 | 4.644391 | 3.912023 | 7.100852 | 5.402677 |
| 5 | Human | True | False | True | True | True | False | Done studying. Need challenges. | 56 | 1 | 22 | 7 | 2017-04-11 14:08:07+00:00 | 2023-10-11 05:59:26+00:00 | 0 | 4.043051 | 0.693147 | 3.135494 | 2.079442 |
| 6 | Human | True | False | True | True | True | True | Administrator of MOONGIFT that is introducing open source software everyday to Japanese engineers since 2004. | 277 | 1139 | 63 | 16 | 2008-04-07 22:22:22+00:00 | 2023-09-27 09:04:56+00:00 | 0 | 5.627621 | 7.038784 | 4.158883 | 2.833213 |
| 7 | Human | True | False | True | False | True | False | Senior Software Engineer at Google, working on Certificate Transparency and generalized transparency. | 37 | 1 | 22 | 0 | 2012-01-19 21:57:07+00:00 | 2023-08-07 16:06:34+00:00 | 0 | 3.637586 | 0.693147 | 3.135494 | 0.000000 |
| 9 | Human | True | False | True | True | True | False | Hi | 42 | 9 | 14 | 2 | 2013-07-23 23:29:34+00:00 | 2023-10-09 20:47:05+00:00 | 0 | 3.761200 | 2.302585 | 2.708050 | 1.098612 |
| 10 | Human | True | False | True | False | True | False | \n Software Engineer\n | 42 | 13 | 13 | 26 | 2016-04-05 07:29:09+00:00 | 2023-10-05 11:27:42+00:00 | 0 | 3.761200 | 2.639057 | 2.639057 | 3.295837 |
| 11 | Human | True | False | True | False | False | False | Senior Staff SWE on Open Source Security @ Google.\n\nFounder of the OSV.dev project, co-founder of OSS-Fuzz. | 47 | 2 | 208 | 4 | 2011-04-29 14:08:17+00:00 | 2023-10-14 02:56:25+00:00 | 0 | 3.871201 | 1.098612 | 5.342334 | 1.609438 |
| 12 | Human | True | False | True | True | True | True | 👋 • Developer enjoying Cloud Infrastructure and Artificial Intelligence. Mathematics Student at Paris-Saclay | 20 | 0 | 22 | 32 | 2017-06-27 19:04:38+00:00 | 2023-09-22 12:01:52+00:00 | 0 | 3.044522 | 0.000000 | 3.135494 | 3.496508 |
| 13 | Human | True | False | True | True | True | True | 👉Web Dev Freelance | 17 | 2 | 34 | 26 | 2018-01-02 17:08:06+00:00 | 2023-09-27 06:39:00+00:00 | 0 | 2.890372 | 1.098612 | 3.555348 | 3.295837 |
| label | type | site_admin | company | blog | location | hireable | bio | public_repos | public_gists | followers | following | created_at | updated_at | text_bot_count | log_public_repos | log_public_gists | log_followers | log_following | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 19732 | Bot | True | False | False | True | True | False | 🏗️ 👷♂️ | 19 | 0 | 18 | 29 | 2015-02-21 19:10:54+00:00 | 2023-09-22 18:10:36+00:00 | 0 | 2.995732 | 0.000000 | 2.944439 | 3.401197 |
| 19737 | Human | True | False | True | False | False | False | Building the next generation data integration protocol! | 0 | 0 | 18 | 1 | 2021-09-09 13:33:01+00:00 | 2023-10-07 12:18:34+00:00 | 0 | 0.000000 | 0.000000 | 2.944439 | 0.693147 |
| 19740 | Human | True | False | False | False | True | False | Step by step for last success. | 28 | 0 | 187 | 2 | 2013-04-27 06:59:03+00:00 | 2023-10-07 11:22:16+00:00 | 0 | 3.367296 | 0.000000 | 5.236442 | 1.098612 |
| 19747 | Human | True | False | False | False | False | False | physics | 32 | 0 | 37 | 1 | 2013-07-10 16:55:44+00:00 | 2023-10-14 10:32:32+00:00 | 0 | 3.496508 | 0.000000 | 3.637586 | 0.693147 |
| 19749 | Human | True | False | True | True | True | False | Senior Software Engineer, WWW and beyond! | 68 | 10 | 42 | 27 | 2011-07-31 03:41:02+00:00 | 2023-10-10 14:27:48+00:00 | 2 | 4.234107 | 2.397895 | 3.761200 | 3.332205 |
| 19751 | Human | True | False | False | True | True | True | Linux Kernel developer at Qualcomm Innovation Center. Alum of Purdue University. | 32 | 20 | 21 | 4 | 2013-09-15 02:41:10+00:00 | 2023-03-13 02:43:39+00:00 | 0 | 3.496508 | 3.044522 | 3.091042 | 1.609438 |
| 19754 | Bot | True | False | True | False | True | False | Software engineer @intel | 6 | 3 | 2 | 0 | 2018-05-31 02:26:59+00:00 | 2023-09-29 09:45:07+00:00 | 0 | 1.945910 | 1.386294 | 1.098612 | 0.000000 |
| 19760 | Bot | True | False | False | False | False | False | I am the bot account of @alvaroaleman | 1 | 0 | 0 | 0 | 2018-12-15 19:55:31+00:00 | 2021-07-27 14:14:25+00:00 | 2 | 0.693147 | 0.000000 | 0.000000 | 0.000000 |
| 19763 | Bot | True | False | True | True | True | False | Tony came to Linux in 1994 and has never looked back. His entire professional career has been spent working with or on Linux. First as a systems administrator | 36 | 16 | 11 | 4 | 2014-07-02 23:27:34+00:00 | 2023-08-15 16:38:34+00:00 | 0 | 3.610918 | 2.833213 | 2.484907 | 1.609438 |
| 19765 | Human | True | False | True | False | True | False | Software engineer at RealTracs. | 13 | 0 | 10 | 1 | 2015-11-14 14:44:05+00:00 | 2022-08-23 21:09:49+00:00 | 0 | 2.639057 | 0.000000 | 2.397895 | 0.693147 |